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Abstract. Univariate polynomials with only real roots - while special — do 
occur often enough that their properties can lead to interesting conclusions in 
diverse areas. Due mainly to the recent work of two young mathematicians, 
Julius Borcea and Petter Branden, a very successful multivariate generaliza- 
tion of this method has been developed. The first part of this paper surveys 
some of the main results of this theory of "multivariate stable" polynomials 
- the most central of these results is the characterization of linear transfor- 
mations preserving stability of polynomials. The second part presents various 
applications of this theory in complex analysis, matrix theory, probability and 
statistical mechanics, and combinatorics. 



1. Introduction. 

I have been asked by the AMS to survey the recent work of Julius Borcea and 
Petter Branden on their multivariate generalization of the theory of univariate poly- 
nomials with only real roots, and its applications. It is exciting work - elementary 
but subtle, and with spectacular consequences. Borcea and Branden take center 
stage but there are many other actors, many of whom I am unable to mention in 
this brief treatment. Notably, Leonid Gurvits provides a transparent proof of a 
vast generalization of the famous van der Waerden Conjecture. 

Space is limited and I have been advised to use "Bourbaki style", and so this 
is an account of the essentials of the theory and a few of its applications, with 
complete proofs as far as possible. Some relatively straightforward arguments have 
been left as exercises to engage the reader, and some more specialized topics are 
merely sketched or even omitted. For the full story and the history and context 
of the subject one must go to the references cited, the references they cite, and so 
on. The introduction of [1], in particular, gives a good account of the genesis of the 
theory. 

Here is a brief summary of the contents. Section 2 introduces stable polyno- 
mials, gives some examples, presents their elementary properties, and develops 
multivariate generalizations of two classical univariate results: the Hermite-Kakeya- 
Obreschkoff and Hermite-Biehler Theorems. We also state the Polya-Schur Theo- 
rem characterizing "multiplier sequences" , as this provides an inspiration for much 
of the multivariate theory. Section 3 restricts attention to multiaffine stable polyno- 
mials: we present a characterization of multiaffine real stable polynomials by means 
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of parameterized quadratic inequalities, and characterize those linear transforma- 
tions which take multiaffine stable polynomials to stable polynomials. In Section 4 
we use parts of the forgoing for Borcea and Branden's splendid proof of the Grace- 
Walsh-Szego Coincidence Theorem. In Section 5, the Grace- Walsh-Szego Theorem 
is used to extend the results of Section 3 from multiaffine to arbitrary stable polyno- 
mials. This culminates in an amazing multivariate generalization of the Polya-Schur 
Theorem, the proof of which requires the development of a multivariate extension 
of the Szasz Principle (which is omitted, regretfully, for lack of space). Section 6 
presents Borcea and Branden's resolution of some matrix-theoretic conjectures of 
Johnson. Section 7 presents the derivation by Borcea, Branden, and Liggett of neg- 
ative association inequalities for the symmetric exclusion process, a fundamental 
model in probability and statistical mechanics. Section 8 presents Gurvits's sweep- 
ing generalization of the van der Waerden Conjecture. Finally, Section 9 briefly 
mentions a few further topics that could not be included fully for lack of space. 

I thank Petter Branden kindly for his helpful comments on preliminary drafts of 
this paper. 

2. Stable polynomials. 

We use the following shorthand notation for multivariate polynomials. Let [m] = 
{1, 2, m}, let x — (cci , x m ) be a sequence of indeterminates, and let C[x] be the 
ring of complex polynomials in the indeterminates x. For a function a : [m] — ► N, 
let x Q = x" ■ ■ ■ ij m ' be the corresponding monomial. For S C [to] we also let 
x s = IliGS^i- Similarly, for i £ [to] let di = d/dxi, let d — {d\, ...,d m ), let 

d a = t?i ■* • • - dm and let d s — Yiies^- The cons t an t functions on [to] with 
images or 1 are denoted by and 1, respectively. The x indeterminates are always 
indexed by [to]. 

Let IK — {z £ C : Im(z) > 0} denote the open upper half of the complex plane, 
and IK the closure of IK in C. A polynomial / <G C[x] is stable provided that either 
/ = identically, or whenever z = (zi,...,z m ) £ IK m then /(z) ^ 0. We use 6[x] 
to denote the set of stable polynomials in C[x], and Sr[x] = S[x]dl[x] for the set 
of real stable polynomials in R[x]. (Borcea and Branden do not consider the zero 
polynomial to be stable, but I find the above convention more convenient.) 

We rely on the following essential fact at several points. 

Hurwitz's Theorem (Theorem 1.3.8 of [2]). Let f2 C C m be a connected open 
set, and let (/„ : n £ N) be a sequence of functions, each analytic and nonvanishing 
on fl, which converges to a limit f uniformly on compact subsets o/f2. Then f is 
either nonvanishing on or identically zero. 

Consequently, a polynomial obtained as the limit of a convergent sequence of 
stable polynomials is itself stable. 

2.1. Examples. 

Proposition 2.1 (Proposition 2.4 of ,1 ]). For i £ [m], let Ai be an n-by-n matrix 
and let Xi be an indeterminate, and let B be an n-by-n matrix. If Ai is positive 
semidefinite for all i £ [m] and B is Hermitian then 

/(x) = det(xiAi + x 2 A 2 + ■■■ + x m A m + B) 



MULTIVARIATE STABLE POLYNOMIALS 



3 



is real stable. 

Proof. Let / denote the coefficientwise complex conjugate of /. Since A, = Aj for 
all i £ [to], and B = B T , it follows that / = /, so that / £ K[x]. By Hurwitz's 
Theorem and a routine perturbation argument, it suffices to prove that / is stable 
when each Ai is positive definite. Consider any z = a + ib £ Ji m , with a, b £ M m 
and 6j > for all i £ [to] (abbreviated to b > 0). Now Q — YllLi ^iM is positive 
definite, and hence has a positive definite square-root Q 1 ! 2 . Also note that H = 
Y^iLi a-iM + B is Hcrmitian, and that 

/(z) = det(Q) det(i/ + Q~ 1/2 HQ~ 1/2 ). 

Since det(Q) ^ 0, if /(z) = then — i is an eigenvalue of Q~ 1 / 2 HQ~ 1 / 2 , contradict- 
ing the fact that this matrix is Hcrmitian. Thus, /(z) ^ for all z £ J{ m . That is, 
/ is stable. □ 

Corollary 2.2. Let Q be an n-by-m complex matrix, and let X = diag(xi, ...,x m ) 
be a diagonal matrix of indeterminates. Then /(x) = dct(QXQ^) is real stable. 

Proof. Let Q = (qij), and for j £ [to] let Aj denote the n-by-n matrix with /w-th 
entry QhjQij- That is, Aj = QjQ^ in which Qj denotes the j-th column of Q. Since 
each Aj is positive semidefinite and QXQ^ = X\A\ + • • • + x m A m , the conclusion 
follows directly from Proposition 2.1. □ 

2.2. Elementary properties. The following simple observation often allows mul- 
tivariate problems to be reduced to univariate ones, as will be seen. 

Lemma 2.3. A polynomial f £ C[x] is stable if and only if for all a, b £ R m with 
b > 0, /(a + bt) is stable in &[t\. 

Proof. Since % m = {a + bt : a, b £ R m , b > 0, and t £ the result follows. □ 

For / £ C[x] and i £ [to], let degj(/) denote the degree of Xi in /. 

Lemma 2.4. These operations preserve stability of polynomials in C[x]. 

(a) Permutation: for any permutation a : [to] — > [to], / i— > /(avm, ... 1 x a ^ m \ l ). 

(b) Scaling: for c £ C and a £ R m with a > 0, / i— > cf{a\X\, . . . , a m x m ). 

(c) Diagonalization: for {i,j} C [to], / /(x)^—^.. 

(d) Specialization: for a £ "K, f i— ► /(a, ^2, • • ■ , x m ). 

(e) Inversion: i/deg^/) = d, f ^ xff(-x^ 1 ,x 2 , ■ ■ -,x m ). 

(f) Differentiation (or "Contraction"): f <9i/(x) . 

Proof. Parts (a,b,c) are clear. Part (d) is also clear in the case that Im(a) > 0. For 
a €E K apply part (d) with values in the sequence (a+i2~" : n £ N), and then apply 
Hurwitz's Theorem to the limit as n — > oo. Part (e) follows from the fact that 'K is 
invariant under the operation z — z^ 1 . For part (f), let d = deg 1 (/), and consider 
the sequence /„ = n~ d f(nxi,X2, ■ ■ ■ ,x m ) for all n > 1. Each /„ is stable and the 
sequence converges to a polynomial, so the limit is stable. Since deg 1 (/) = d, this 
limit is not identically zero. This implies that for all z 2 , z m £ 'K, the polynomial 
g{x) = f(x,Z2,.-.,z m ) £ C[x] has degree d. Clearly g'(x) — d\f(x, z 2 , z m ). Let 
£i, ■■■,£d be the roots of g(x), so that g(x) — c\\ h=1 (x — for some c £ C. Since 
/ is stable, Im(^) < for all h £ [d]. Now 

g'(x) d , . . 1 

= — log gf(x = > — . 
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If lm(z) > then Im(l/(z - £ h )) < for all h £ [d], so that g'(z) ^ 0. Thus, if 
z S M m then 5i/(z) ^ 0. That is, 9 X / is stable. □ 

Of course, by permutation, parts (d,e,f) of Lemma 2.4 apply for any index i £ [to] 
as well (not just Part (f) is essentially the Gauss-Lucas Theorem: the roots 

of g'(x) lie in the convex hull of the roots of g(x). 

2.3. Univariate stable polynomials. A nonzero univariate polynomial is real 
stable if and only if it has only real roots. Let / and g be two such polynomials, 
let £i < £2 < ■ • • < £fc be the roots of /, and let Q\ < 62 < • ■ • < 9i be the roots of 
g. These roots are interlaced if they are ordered so that £1 < 0\ < £2 < #2 < ■ ■ ■ or 
9i < £1 < 6 2 < £2 < ■ ■ ■ ■ For each i £ [£], let = g/(x - 6>j). If deg / < degg and 
the roots of g are simple, then there is a unique (a, bi, ... , bi) £ R^ +1 such that 

/ = ag + bigi H h b e g e . 

Exercise 2.5. Let /, g G 6m [x] be nonzero and such that fg has only simple roots, 
let deg / < deg g, and let 6\ < ■ ■ ■ < &e be the roots of g. The following are 
equivalent: 

(a) The roots of / and g are interlaced. 

(b) The sequence K62), ■ ■ ■ , f{0i) alternates in sign (strictly). 

(c) In / — ag + bigi, all of 61, ... , bi have the same sign (and are nonzero). 

The Wronskian of /, g e C[x] is W[/, g] = f ■ g - f ■ g' '. If / = ag + Y?i=i b i9i as 
in Exercise 2.5 then 

W[/,g] = d / A = -6j 

It follows that if / and 5 are as in Exercise 2.5(a) then W[f,g] is either positive 
for all real x, or negative for all real x. Since W[g, f] — —W[f,g] the condition 
that deg / < deg g is immaterial. Any pair /, g with interlacing roots can be 
approximated arbitrarily closely by such a pair with all roots of fg simple. It 
follows that for any pair /, g with interlacing roots, the Wronskian W[/, g] is cither 
nonnegative on all of R or nonpositive on all of R. 

Nonzero univariate polynomials /, g £ &^[x] are in proper position, denoted by 
/ <C g, if W[/, g] < on all of R. For convenience we also let -C / and / -C for 
any / £ 6m [x); in particular <§; 0. 

Exercise 2.6. Let f,g£ &r[x] be real stable. Then / -c g and g <C / if and only 
if cf = dg for some c, c? € R not both zero. 

Hermite-Kakeya-Obreschkoff (HKO) Theorem (Theorem 6.3.8 of [14 ). Let 

f,g £ M[x\. Then af + bg £ 6r[x] for all a, b £ R if and only if f,g £ &m[x] and 
either f <C g or g -C /. 

Hermite-Biehler (HB) Theorem (Theorem 6.3.4 of [3]). Let f,g £ R[x]. 
Then g + if £ &[x] if and only if f,g £ &m[x] and f <S g. 

Proofs of HKO and HB. It suffices to prove these when fg has only simple roots. 

For HKO we can assume that deg(/) < deg(g). Exercise 2.5 shows that if the 
roots of / and g are interlaced then for all a, b £ R, the roots of g and af + bg 
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arc interlaced, so that af + bg is real stable. The converse is trivial if cf = dg for 
some c, d G M not both zero, so assume otherwise. From the hypothesis, both / 
and g are real stable. If there are zq,z\ G K for which Im(/(zo)/g(zo)) < and 
lm(f(zi)/g(zi)) > 0, then for some A G [0, 1] the number z\ — (1 — A)zo+Azi is such 
that lm(f(z\)/g(z\)) — 0. Thus f(z\) — ag(z\) = for some real number a G R. 
Since / — ag is stable (by hypothesis) and z\ G K, this implies that / — ag = 0, a 
contradiction. Thus Im(/(z)/g(z)) does not change sign for z G "X . This implies 
Exercise 2.5(c): all the hi have the same sign (consider f/g at the points 9i +ie for 
e > approaching 0). Thus, the roots of / and g are interlaced. 

For HB, let p — g + if. Considering ip = —f + ig if necessary, we can assume 
that deg/ < degg. If / <C g then Exercise 2.5(c) implies that lm(f (z) / g(z)) < 
for all z G K, so that g + if is stable. For the converse, let p(x) = cY\ i=1 (x — £;), 
so that Im(£i) < for all i £ [d]. Now \z - &| > \z - ^| for all z e M and % G [d], 
so that > |p(^)| for all z e J{. For any z G IK with /(z) ^Owe have 



f(z) 



> 



9{z) 



and it follows that Im(<?(z)//(z)) > for all z G !H with /(z) ^ 0. Since fg has 
simple roots it follows that g(x) + yf(x) is stable in &[x, y]. By contraction and 
specialization, both / and g arc real stable. By scaling and specialization, af + bg 
is stable for all a, b 6 R. By HKO, the roots of / and g are interlaced. Since 
lm(f(z)/g(z)) < for all z G IK, all the bi in Exercise 2.5(c) are positive, so that 
W[/, g) is negative on all of K: that is / -C g- □ 



For A : N — > E, let T\ : R[x] — ► R[cc] be the linear transformation defined by 
T\(x n ) — X(n)x n and linear extension. A multiplier sequence (of the first kind) is 
such a A for which T\(f) is real stable whenever / is real stable. Polya and Schur 
characterized multiplier sequences as follows. 

Polya-Schur Theorem (Theorem 1.7 of [4 ]). Let A : N — * R. The following are 
equivalent: 

(a) A is a multiplier sequence. 

(b) F\(x) = X^^Lo A(n)a; n /n! is an entire function which is the limit, uniformly on 
compact sets, of real stable polynomials with all roots of the same sign. 

(c) Either F\(x) or F\(—x) has the form 

oo 

Cx n e ax Y[(l + a jX ) 7 
i=i 

in which C G R, n G N, a > 0, all Uj > 0, and XjLi a j * s finite- 

(d) For a// rt G N, f/ie polynomial + x)™) is reaZ stable with all roots of the 
same sign. 



One of the main results of Borcea and Branden's theory is a great generalization 
of the Polya-Schur Theorem - a characterization of all stability preservers: linear 
transformations T : C[x] — > C[x] such that T(/) is stable whenever / is stable. 
(Also the analogous characterization of real stability preservers.) This is discussed 
in some detail in Section 5.3. 
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2.4. Multivariate analogues of the HKO and HB Theorems. By analogy 
with the univariate HB Theorem, polynomials f,g £ K[x] are said to be in proper 
position, denoted by / <C g, when g + if £ S[x]. (As will be seen, this implies that 
f,g £ 6r [x].) Thus, the multivariate analogue of Hermite-Biehler is a definition , 
not a theorem. 

Proposition 2.7 (Lemma 1.8 and Remark 1.3 of |5 ). Let f,g G C[x]. 

(a) If f,g G R[x] i/ien f < g if and only if g + yf G 6 R [x, y] . 

(b) 7/ ^ / G 6[x] tfien y + y/ G 6[x, y] i/ and only if for all z G 

-08) « 

Proof. If g + yf £ Sk[x, y] then g + if G S[x], by specialization. Conversely, 
assume that h = g + if £ 6[x] with f,g£ K[x], and let z = a + io with a, 6 G M 
and 6 > 0. By Lemma 2.3, for all a, b G W n with b > we have ft(a + bt) 6 6|. 
By HB, /(<) = /(a + bt) and 5(f) = gr(a + bt) are such that 7 < By HKO, 
cf + dg G Sr[7] for all c,deM. By HKO again, the roots of 6/ and of g + af are 
interlaced. Since W[bf,g + af] = bW[f,g] < on M, it follows that 6/ < 5 + a/. 
Finally, by HB again, y + (a + ib)f G 6[t]. Since this holds for all a, b G M m with 
b > 0, Lemma 2.3 implies that g + (a + ib)f G 6[x]. Since this holds for all a, b G M 
with b > 0, g + yf £ 6[x, y]. This proves part (a). 

For part (b), first let g + yf be stable. By specialization, y is also stable. If g = 
then there is nothing to prove. Otherwise, consider any z G IK™, so that /(z) 7^ 
and g(z) ^ 0. There is a unique solution z G C to <?(z) + z/(z) = 0, and since 
g + yf is stable, Im(z) < 0. Hence, lm(g(z) / f(z)) = Im(— z) > 0. This argument 
can be reversed to prove the converse implication. □ 

Exercise 2.8 (Corollary 2.4 of [4\). 6[x] = {g + if : f,g G 6 R [x] and / < g}. 

Here is the multivariate HKO Theorem of Borcea and Branden. 

Theorem 2.9 (Theorem 1.6 of [4 ). Let f,g G E[x]. Then af + bg G 6 R [x] /or a// 
a, & G K i/ and onZy if f, g G 6r[x] and either f <C .0 or g <C / • 

Proof. First assume that / <C g, and let 8,1) £ R with 6 > 0. By Proposition 
2.7(a), g + yf £ Sr[x,?/]. By scaling and specialization, 05 + (a + i)/ G S[x]. By 
Proposition 2.7(a) again, / -C (a/ + 65). Thus af + bg £ Sr[x] for all a, 6 G M. 
The case that g -C / is similar. 

Conversely, assume that af + bg £ 6r[x] for all a, 6 G K. Let a, b G K m with 
b > 0, and let f(t) = /(a+bt) and g(t) = #(a+bt). By Lemma 2.3, a/+6y G 6 R [t] 
for all o,kR. By HKO, for each a, b G R m with b > 0, either / < g or g < /. 

If / < 3 for all a, b G M m with b > 0, then by HB, <?+i7G G[t] for all a, b G E m 
with b > 0. Thus g + if £ S[x] by Lemma 2.3, which is to say that / C5 (by 
definition). Similarly, ifg<^ f for all a, b G M. m with b > then g <C /. 

It remains to consider the case that /(a<j + bot) <C <7(ao + bot) for some ao, bo G 
R m with b > 0, and g(a.i + bit) < /(ai + bit) for another ai,bi G R m with 
bi > 0. For < A < 1, let a* = (1 - A)a + Aai and b\ = (1 - A)b + Abi. Since 
roots of polynomials move continuously as the coefficients are varied continuously, 
there is a value < A < 1 for which both f(a x + b\t) <C g(a\ + b\t) and g(a\ + 
b>i) *C /( a A + bA^)- From Exercise 2.6, it follows that c/(a A +b A t) = dg(a x + b\t) 
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for some c,rf£l not both zero. Now h — cf — dg £ 6[x] by hypothesis, and since 
h(a.x + b\t) = identically it follows that h(a\ + ih\) — 0. Since > and h is 
stable, this implies that h = 0, so that cf = dg in 6[x]. In this case, both / <C g 
and g < / hold. □ 

For f,g£ C[x] and i £ [m], let W* [/, g] = t?i/ ■ g — f • dig be the i-i/i Wronskian 
of the pair (/, g). 

Corollary 2.10 (Theorem 1.9 of [5]). Lei f,g£ R[x]. T/ie following are equivalent: 

(a) g + if is stable in S[x], £/ia£ is f ^ g; 

(b) g + yf is real stable in @r [x, y]; 

(c) af + bg£ 6r [x] /or alla,b gR, and Wj [/, g] (a) < /or a// i G [to] and a £ K m . 

Proof. Proposition 2.7(a) shows that (a) and (b) are equivalent. 

If (a) holds then Theorem 2.9 implies that af + bg £ Sr[x] for all a, b £ E. To 
prove the rest of (c), let i € [m] and a G K m , and let Si £ M m be the unit vector 
with a one in the ?-th position. Since / <C g, for any b £ M. m with b > we have 
/(a + (b + 5i)t) <C ff(a + (b + in Sr[4], from Proposition 2.7(a) and Lemma 
2.3. By the Wronskian condition for univariate polynomials in proper position, 

W[/(a + (b + 5i)t), ff (a + (b + Si)t)] < 

for all tel. Taking the limit as b —> and evaluating at t — yields 

Wi[/, fl ](a) = W[/(a+^) )9 (a + ^)]| t= o < 0, 

by continuity. Thus (a) implies (c). 

To prove that (c) implies (b), let a, b £ M. m with b = (6 1 ,...,6 m ) > 0, and let 
a,b £ M. with b > 0. By Lemma 2.3, to show that g + yf £ 6r[x, y) it suffices to 
show that g(& + hi) + (a + i6)/(a + ht) £ ©[t]. From (c) it follows that p = g + af 
and q = bf are such that ap+ j3q £ ©r[x] for all a, (3 £ R. By Theorem 2.9, either 
p <C q or q <C P- Now 

W[g(a + bi),p(a+b<)] = 6W[/(a + bt), g(a + bt)] 

m 

= bJ2hWi[f,g](si + bt)<0, 

i=l 

by the Wronskian condition in part (c). Thus q(a + ht) -C p(a + ht), so that 
p(a + ht) +iq(a + ht) £ ©[<]. Since p + iq = q+(a + ib)f, this shows that (c) implies 
(b). ~ ' □ 

Exercise 2.11 (Corollary 1.10 of [5 ). Let f,g £ 6 R [x] be real stable. Then / <S g 
and g <C / if and only if cf = dg for some c, d £ K not both zero. 

Proposition 2.12 (Lemma 3.2 of [5])- Let V be a K-vector subspace o/K[x], with 
either K = R or K = C. 

(a) 7/K = R and V C 6 R [x] t/ien dim R V" < 2. 

(b) IfK = C anrf V C 6[x] t/ien dim c V < 1. 

Proof. For part (a), suppose to the contrary that f,g,h £ V are linearly indepen- 
dent over K (and hence not identically zero). By Theorem 2.9, either / <C <? or 
,o <C /, and similarly for the other pairs {/, /i} and {g, h}. Renaming these polyno- 
mials as necessary, we may assume that / <C h and h <C 5. Now, for all A € [0, 1] let 
Pa = (1 — A)/ + Ag, and note that each p> ^ 0. By Theorem 2.9, for each A £ [0, 1] 
either h <C Pa or Pa *C ^- Since p = / -C /i and /i -C .0 = pi, by continuity of 
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the roots of {p\ : A £ [0,1]} there is a A € [0, 1] such that h -C p\ and p\ <C h. 
But then, by Exercise 2.11, either {/, g} is linearly dependent or h is in the span 
of {/, g}, contradicting the supposition. 

For part (b), let Re(V) — {Re(h) : h £ V}. Then Re(V) is a real subspace of 
6r[x], so that diniRRe(y) < 2 by part (a). If diiriRRe(V) < 1 then dime V < 1. 
In the remaining case let {p, q} be a basis of Re(V) with / = p + iq £ V . By 
Corollary 2.10, Wi[q,p](a) < for all i £ [m] and a £ E m . Since p and q are not 
linearly dependent, there is an index k £ [m] such that Wk[q,p] ^ 0. 

Consider any g £ V. There are reals a,b,c,d € E such that 

g = (ap + bq) + i(cp + dq). 

Since g is stable, Wk[cp + dq, ap + bq](a) = (ad — 6c)Wfc[g,p](a) < for all a £ E m . 
Since Wk[q,p) ^ 0, it follows that ad—bc> 0. Now, for any v,w £ E, g+(v + iw)f 
is in 1^. Since 

g + (v + \w)f = (a + v)p + (b — w)q + i((c + w)p + (d + v)q), 

this argument shows that H = (a + v)(d + v) — (b — w)(c + w) > for all v, w £ E. 
But 

AH = (2v + a + d) 2 + (2w + c- b) 2 - (a - d) 2 - (b + c) 2 , 

so that H > for all v, w £ E if and only if a = d and b = — c. This implies that 
g = (a + ic)/, so that dime V = 1. □ 

3. MULTIAFFINE STABLE POLYNOMIALS. 

A polynomial / is multiaffine if each indeterminate occurs at most to the first 
power in /. For a set § of polynomials, let S MA denote the set of multiaffine 
polynomials in S. For multiaffine / £ C[x] MA and i £ [m] we use the "ultra- 
shorthand" notation / = f l + Xif. L in which /' = /| Kj =o and ft — dif '. This 
notation is extended to multiple distinct indices in the obvious way in particular, 

3.1. A criterion for real stability. For / £ C[x] and {i, j} C [m], let 

A,,/ - dif ■ 0,f - f ■ didjf. 
Notice that for / £ C[x] MA , 

A y / = fib - ft fa = Vfitfjj] = -Vfi[fj,f s ], 

and 

A y / = fl fj — f 3 fij- 

Theorem 3.1 (Theorem 5.6 of 8 and Theorem 3 of US]). Let / £ E[x] MA be 
multiaffine. The following are equivalent: 

(a) / is real stable. 

(b) For all {i,j} C [m] and all a £ E m , A 2J /(a) > 0. 

(c) Either m = 1, or f/iere exists {i,j} C [m] smc/i </iai /j, / l , /j and / J are reaZ 
siaWe, anrf A lJ /(a) > /or a// a e E m . 
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Proof. To see that (a) implies (b), fix {i,j} C [to]. Proposition 2.7(a) shows 
that fj <C p , and from the calculation above and Corollary 2.10, it follows that 
Ay/fa) = -Wj (a) > for all a e R m . 

We show that (b) implies (a) by induction on to, the base case m — 1 being 
trivial. For the induction step let / be as in part (b), let a € R, and let g — 
f\x m =a- For all {i,j} C [to - 1] and a e R m_1 , A y5 (a) = Ay/(a,o) > 0. By 
induction, g = / m + o/ m is real stable for all a G R; it follows that af m + bf m G 
6k[^i ; ■ ■ • i %m-i] for all a, b € K. Furthermore, for all j G [to — 1] and a e R m_1 , 
W 3 [/ m ,/ m ](a) = -A jm /(a,l) < 0. This verifies condition (c) of Corollary 2.10 
for the pair (/ m , f m ), and it follows that / = f m + x m f m € 6r[x], completing the 
induction. 

It is clear that (a) and (b) imply (c) - we show that (c) implies (b) below. This 
is clear if to < 2, so assume that m > 3. To begin with, let {h,i,j} C [m] be 
three distinct indices, and consider Aj,/ as a polynomial in rr/j. That is, Ay / = 
A hijX 2 h + B hij x h + C hi: j in which 

-^hij = fhi-fhj ~ fhfhij = Ajj/fo, 

Bhij = fll f ~ fH 1 % + j f J fij - 1 :hi > J 'hij, and 

^hij — h J j ~ / foj — ^ijJ ■ 

If Ajj/(a) > for all a £ R m then this quadratic polynomial in Xh'- 

A,j/(ai, . . . , a,h-i,Xh, fflh+i, • • • , a m ) 

has a nonpositive discriminant for all a e R m . That is, Dhij = B\ { - — AAhijChij is 
such that D hij (a) < for all aeT. 

It is a surprising fact that as a polynomial in {xk ■ k G [m] \ {h, i,j}}, Dhij is 
invariant under all six permutations of its indices, as is seen by direct calculation: 

D hij = (/^/5) 2 + (/^4-) 2 + (/f/^) 2 + (/^-/^) 2 

n( fij fh fhj fi , r/ij j-i ihii j , rhi fj fij fh\ 
L \Jh JijJi Jhj^Ji JhjJj "'hi'Jj JhiJhJij) 

+^Ajfij hij +±fifffYhii- 

Now for the proof that (c) implies (b) when to > 3. Consider any h G [to] \{i, j}. 
Then 

Aw/ = AjhiX 2 j + BjhiXj + Cjhi 

has discriminant Djhi — Dhij. Since fj and f^ are real stable, we have Ajhi(a) = 
Afci/j^a) > and C J -/ li (a) = A hi f j (a) > for all a e M m . Since A ij /(a) > for 
all a G R m it follows that D jhi (a) = D hl ,(a) < for all a e R m . It follows that 
A w /(a) > for all a e R m . (Note that if B 2 - AAC < and cither A = or 
C = 0, then £> = 0.) A similar argument using the fact that fi and /* are real 
stable shows that A hj f(a) > for all a e M m . 

It remains to show that A^fc/(a) > for all a e R m when {h, k} is disjoint from 
We have seen that A/ l ,/(a) > for all a S K m , and we know that both 
and / l are real stable. The argument above applies once more: A^/(a) > for all 
a e K m , so that D ihk (a) = D khi (a) < for all a e M m , and then since A lh k{&) > 
and C ihk {a) > for all a e M m it follows that A hk f(a) > for all a e M m . Thus 
(c) implies (b). □ 
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3.2. Linear transformations preserving stability multiaffine case. 

Lemma 3.2 (Lieb-Sokal Lemma, Lemma 2.1 of [5])- Let g(x) + y/(x) £ 6[x,?/] be 
stable and such that deg^f) < 1. Then g — dif £ S[x] is stable. 

Proof. Since g is stable (by specialization to y = 0), there is nothing to prove if 
dif = identically, so assume otherwise (and hence that / ^ 0). By permutation 
we can assume that i = 1. Since / is stable and z\, z £ !K imply that zi — z^ 1 £ !K, 
it follows that 

y.f{xi -y~ 1 ,x 2 ,-.,x m ) = -<9i/(x) +y/(x) 
is stable. Proposition 2.7(b) implies that for all z £ IH m , 

Im f g W-y W> i = Im + lm f=4P 1 > 0. 



/(z) ; v/w; V M 

Thus, by Proposition 2.7(b) again, g — dif + yf is stable. Specializing to y = 
shows that g — dif is stable. □ 

Exercise 3.3 (Lemma 3.1 of [5]). Let / £ C[x] MA and w £ Jf m Then for all e > 
sufficiently small, (x + w)[™l + e/(x) is stable. (Here (x + w)I m l = n™=i( x i + w i)-) 

For a linear transformation T : C[x] MA — > C[x] of multiaffine polynomials, define 
the algebraic symbol of T to be the polynomial 

T((x + y)W)=T(f[( Xt + yi ) \ = J2 nx S )y [mNS 

\i=l ) SC[m] 

in C[xi, . . .,x m ,yx, . . .,y m ] = C[x,y]. 

Theorem 3.4 (Theorem 1.1 of 5 ). Let T : C[x] MA -> C[x] be a linear transfor- 
mation. Then T maps S[x] MA into 6[x] if and only if either 

(a) T(f) = i]{f) ■ p for some linear functional i] : C[x] MA — > C and p £ S[x], or 

(b) the polynomial T((n + y)^) is stable in ©[x,y]. 

Proof. First, assume (b) that T((x + y)[™]) £ S[x,y] is stable. By inversion, it 
follows that y[" l lT((x - y _1 )M) i s also stable. Thus, if / £ 6[wi, w m ] is stable 
then 

y[ m ]T((x-y- 1 )[ m ])/(w)= ^ T(x s )(-y) s /(w) 

SC [m] 

is stable. If / is also multiaffine then repeated application of the Lieb-Sokal Lemma 
3.2 (replacing yi by —d/dwi for i £ [m]) shows that 

£ n**)^/<w) 

SC[m] 

is stable. Finally, specializing to w — shows that T(/(x)) is stable. Thus, the 
linear transformation T maps 6[x] MA into S[x]. This is clearly also the case if (a) 
holds. 

Conversely, assume that T maps 6[x] MA into 6[x]. Then for any w £ 'K m , 
(x + w)I m l £ e[x] MA , so that T((x + w)[ m l) £ 6[x]. 

First, assume that there is a w £ "K m for which T((x + w)I m l) = identically. 
For any / £ C[x] MA let e > be as in Exercise 3.3. Then eT(f) = T((x+w)[" l l +ef) 
is stable, so that T(f) is stable. Thus, the image of C[x] MA under T is a C-subspace 
of 6[x]. By Proposition 2.12(b), T has the form of case (a). 
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Secondly, if T((x + w)'" 1 ') ^ for all w £ l K m then, since each of these polyno- 
mials is in 6[x], we have T((x + w)[" l l) | x=z ^ for all z e M m and w £ Jf m . This 
shows that T((x + y)^ m ')) is stable in S[x, y], which is the form of case (b). □ 

Theorem 3.4 has a corresponding real form - the proof is completely analogous. 

Theorem 3.5 (Theorem 1.2 of 5 j). Let T : M[x] MA -> R[x] be a linear transfor- 
mation. Then T maps 6r[x] ma into 6r[x] if and only if either 

(a) T(f) = f)(f) ■ p + £(/) ■ q for some linear functionals Tj, £ : M[x] MA — ► K and 
p, q £ 6r[x] such that p <C g, or 

(b) i/ie polynomial T((x + y)^ m ') is reaZ stable in 6f[x,y], or 

(c) £/ie polynomial T((x — y)'™]) is real stable in ©g[x,y]. 

Proof. Exercise 3.6. □ 

4. The Grace- Walsh-Szego Coincidence Theorem. 

Let / £ C[x] be a univariate polynomial of degree at most to, and let x = 
(xi, ...,x m ) as usual. For < j < to, the j-th elementary symmetric function of x 
is 

e j( x ) = y , -"il ' ' ' X ij = ^ ] x ■ 

l<ii<— <ij<m SC[m]: \S\=j 

The m-th polarization of / is the polynomial obtained as the image of / under the 
linear transformation Pol m defined by x^ h- > (™) (x) for all < j < m, and lin- 
ear extension. In other words, Pol m / is the unique multiaffine polynomial in C[x] MA 
that is invariant under all permutations of [to] and such that Pol m /(x, ...,x) = f(x). 
A circular region is a nonempty subset A of C that is either open or closed, and 
which is bounded by either a circle or a straight line. 

Theorem 4.1 (Grace- Walsh-Szego, Theorem 3.4.1b of [14]). Let f e C[x] have 
degree at most m and let A be a circular region. Lf either deg(/) = to or A is 
convex, then for every z £ A m there exists z £ A such that Pol m /(z) = f(z). 

Figure 1 illustrates the Grace- Walsh-Szego (GWS) Theorem for the polynomial 
f(x) = x 5 + 10x 2 + 1. The black dots mark the solutions to f(x) — 0. Any 
permutation of the red (grey) dots is a solution to Pol5/(xi, afs) = 0. By GWS, 
any circular region containing all the red dots must contain at least one of the black 
dots. The figure indicates the boundaries of several circular regions for which this 
condition is met. 

The proof of GWS in this section is adapted from Borcea and Branden [B] . 

4.1. Reduction to the case of stable polynomials. First of all, it suffices to 
prove GWS for open circular regions, since a closed circular region is the intersection 
of all the open circular regions which contain it. Second, it suffices to show that 
for any g £ C[x] of degree at most to, if deg(g) = to or A is convex, and z £ A m is 
such that Pol m <7(z) — 0, then there exists z £ A such that g(z) = 0. This implies 
the stated form of GWS by applying this special case to g(x) — f(x) — c, where 
c = Pol m /(z). Stated otherwise, it suffices to show that if f(z) ^ for all z £ A 
then Pol m /(z) for all z £ A rn (provided that either deg(/) = m or A is convex). 

Let M be the set of Mobius transformations z i— > cj)(z) — (az + b)/(cz + d) with 
a, 6, c, d £ C and ab~cd — ±1. Then M with the operation of functional composition 
is a group of conformal transformations of the Riemann sphere C = C U {oo}, 
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Figure 1. Illustration of the Grace- Walsh-Szego Theorem. 

and it acts simply transitively on the set of all ordered triples of distinct points 
of C. Consequently, for any open circular region A there is a <f> £ M such that 
</>(!H) = {4>(z) : z £ %] = A. We henceforth regard circular regions as subsets of 
C. Note that an open circular region A is convex if and only if it does not contain 
oo. (The point oo is on the boundary of any open half-plane.) In this case, if 
<fi(z) = (az + b)/ (cz + d) is such that </>(3"C) = A then cz + d ^ for all zeM. 

Given ^ / £ C[i] of degree at most to, consider the polynomial f(x) = 
(ex + d) m f((ax + b)/(cx + d)). If either deg(/) — m or A is convex, then / is 
nonvanishing on A if and only if f(z) is nonvanishing on "K. Also, 

m 

Pol m /(x) =Pol m f((f>(xi) : ...,(f>(x m )) -Y[(cxi + d). 

i=l 

Thus, to prove GWS it suffices to prove the following lemma. 

Lemma 4.2. Let f £ C[x] be a univariate polynomial of degree at most m. Then 
Pol m / is stable if and only if f is stable. 

Clearly, diagonalization implies that if Pol m / is stable then / is stable, so only the 
converse implication needs proof. This is accomplished in the following two easy 
steps. 
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4.2. Partial symmetrization. The group §(m) of all permutations a : [m] — > [m] 
acts on C[x] by the rule a(f)(xi, ...,x m ) — f(x a ^, £ CT ( m )). Notice that 

m m 

a(x«) = n< ( ( ; ) ) = n a; r ff " lw =x— _1 . 

i=l i=l 

For {i, j} C [m], let r,j be the transposition that exchanges i and j and fixes all 
other elements of [to]. 

Lemma 4.3. Let < A < 1 and {i,j} C [to], and tet 7^ A) = (1 — A) + Xr tj . 7/ 
/ G 6[x] MA is stable and multiaffine then f £ 6[x] MA is stable and multiaffine. 

Proof. If / is multiaffine then (1 — A)/ + Xrij(f) is also multiaffine. We apply 
Theorem 3.4 to show that T = T^. preserves stability of multiaffine polynomials. 
By permutation we can assume that {i,j} = {1, 2}. The algebraic symbol of T is 

m 

T((x + y)H) = + j, l)(a;2 + y 2 )) . J]^. + yt) . 

i=3 

Clearly, this is stable if and only if the same is true of T((xi + yi)(x 2 + y 2 )). Exercise 
4.4 completes the proof. □ 

Exercise 4.4. Use the results of Sections 2.4 or 3.1 to show that for < A < 1, 
the polynomial 

X\X 2 + ((1 - \)x 1 + Xx 2 )y 2 + (Xx 1 + (1 - X)x 2 )y 1 + y Y y 2 

is real stable. 

4.3. Convergence to the polarization. Let =t f(x) £ 6[x] be a univariate 
stable polynomial of degree at most to: say f(x) = c(x — £i) • • • {x — £ n ) in which 
c ^ 0, n < m, and & ^ "K for all i £ [n]. Then the polynomial F £ C[x] defined by 

F (a;i, ...,x m ) = c(x! - £i) ■ ■ ■ (x n - £„) 

is multiaffine and stable, and F (x,...,x) — f(x). Let £ = ({ik,jk} ■ k £ N) 
be a sequence of two-clement subsets of [to], and for each k £ N let T k = T^j^ 
and define F k +i = Tfc(Ffc). By induction using Lemma 4.3, each F k £ 6[x] MA is 
multiaffine and stable, and F k (x, x) — f(x) for all k £ N. We will construct such 
a sequence £ for which (P& : k £ N) converges to Pol m /. 

Let P £ C[x] MA be multiaffine, say P(x) = £ 5C[m] c(5)x s '. For {i, j} C [to] let 

^(P)= £ |c(S)-c(^(S))| 

SC[m] 

be the ij-th imbalance of P, and let ||P|| = Y^i{i j}c[ m ] ^iji-P) ^ e ^ ne total imbalance 
of P. 

Exercise 4.5. (a) Let (P& : fc £ N) be polynomials in C[x] MA for which there is a 
p £ C[x] such that P k (x, ...,x) = p(x) for all k £ N. If ||P fe || — ► as k —> 0, then 
(P fe : k £ N) converges to a limit P £ C[x] MA , and ||P|| = 0. 

(b) For P £ C[x] MA , ||P|| = if and only if P is invariant under all permutations 
of [to]. Thus, in part (a) the limit is P = Pol m p. 
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Exercise 4.6. Let P E C[x] MA , let {i,j} C [m], and let Q = T^ /2) P. 

(a) Then u>ij(Q) = 0. 

(b) If/ie [m]\{i, j} then u hi {Q) < (uj hi (P) + co hj (P))/2, and similarly for uj hi {Q). 

(c) If {/i, fc} C [m] \ {i, j} then u hk {Q) = uj hk (P). 

(d) Consequently, ||Q|| < ||P|| 

Now we choose the sequence £ = ({ifc,jfc} : fc € N) as follows: for each fc e N, 
{^fciifc} C [to] is any pair of indices {i, j} for which ojij(Fk) attains its maximum 
value. Then uj ik j k (Fk) > (™) so that by Exercise 4.6(d) and induction on 

fceN, 

ll^ill^i-^)" 1 )™^!-^)"^" 1 !™ 

Thus, by Exercise 4.5, Ffc converges to Pol m /, the m-th polarization of /. Finally, 
since each F k is stable (and the limit is a polynomial), Hurwitz's Theorem implies 
that Pol m / is stable. This completes the proof of Lemma 4.2, and hence of Theorem 
4.1. 

5. Polarization arguments and stability preservers. 

For k e W n and a set § C C[x] of polynomials, let §- K be the set of all / e § 
such that degj(/) < k(i) for all i <G [to]. Let 

/(/t) = : i G [m] and j € [«(«)]} 

and let u = {uij : G I(k)} be indeterminates. For / e C[x]- K , Let PolJ^/ 

denote the n{i)-th polarization of Xj m /: this is the image of / under the linear 
transformation Polj^ defined by x\ i— ► ej{ua, for each < j < 

and linear extension. Finally, the re-i/i polarization of f is 

Pol K f = Pol ( 7 } , o ■ ■ ■ o Pol ( V, , f • 

This defines a linear transformation Pol K : C[x]- K — > C[u] MA . 

5.1. The real stability criterion revisited. 

Proposition 5.1. Le£ n € N m and / e C[a;]- K . T/ien Pol K / is stable if and only 
if f is stable. 

Proof. Diagonalization implies that if Pol K / is stable then / is stable, so only the 
converse implication needs proof. Assume that / is stable, and let Zij € 'K for 
€ I{ K )- -By induction on to, repeated application of GWS shows that there 
arc z = (z\, . . . , z m ) e "K™ 1 such that 

PoW(^: (i,i)eJ(«)) = /(z). 
Since / is stable it follows that Pol K / is stable. □ 

If / € M[x]- K then Theorem 3.1 applies to Pol K /. Thus, Proposition 5.1 boot- 
straps the real stability criterion from multiaffinc to arbitrary polynomials. This is 
a typical application of the GWS Theorem. 
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5.2. Linear transformations preserving stability polynomial case. 

Theorem 5.2 (Theorem 1.1 of 5 ]). Let re E W n , and let T : C[x]^ K -> C[x] be a 
linear transformation. Then T maps (5[x]- K into S[x] if and only if either 

(a) T(f) = rj(f) ■ p for some linear functional r\ : C[x]- K — > C and p G 6[x], or 

(b) the polynomial T((x + y) K ) is stable in S[x,y]. 

Proof. Let u = {ttjj : G an d define a linear transformation T : 

C[u] MA -> C[x] as follows. For every A C /(re), define a(A) : [m] — * N by 
putting a(A,i) — \{j E [re(i)] : E A}\ for each i E [to]. Then for each 

A C /(k) define f(u A ) = T(x Q ( A )), and extend this linearly to all of C[u] MA . Let 
A : C[u] MA — > C[x] be the diagonalization operator defined by A(ttjj) = Xi for all 
(i, j) E /(re), extended algebraically. 

Notice that T = ToPol K , and that T = To A. By Proposition 5.1 (and Lemma 
2.4), it follows that T preserves stability if and only if T preserves stability. This 
is equivalent to one of two cases in Theorem 3.4. 

In case (a), if T — p ■ rj for some p E 6[x] and linear functional rj : C[y] MA — > C 
then T — p ■ (rj o Pol K ) is also in case (a). Conversely if T is in case (a) then the 
same is true of T, by construction. 

In case (b), let Pol^ : C[y]- K — > C[v] MA denote the K-th polarization of the y 
variables. The symbols of T and T are related by 

T((u + v)'W) = (T o A)((u + v) 7 ^)) = Pol^T((x + y) K ), 

and Proposition 5.1 shows that T is in case (b) if and only if T is in case (b). □ 

5.3. Linear transformations preserving stability transcendental case. 

Exercise 5.3. Let T : C[x] — > C[x] be a linear transformation. 

(a) Then T : 6[x] -► 6[x] if and only if T : e[x]^ K 6[x] for all re € N m . 

(b) Define 5 : C[x,y] -> C[x,y] by S'(x a y' 3 ) = T(x Q )y' 5 and linear extension. If 
T((x + u) K ) is stable for all re G N m then S((x + u) K (y + v)^) is stable for all 
k, P G N m . 



Let S[x] denote the set of all power series in C[[x]] that arc obtained as the limit 
of a sequence of stable polynomials in <S[x] which converges uniformly on compact 
sets. Theorem 5.4 is an astounding generalization of the Polya-Schur Theorem. For 
a E N m , let a! = UZi^V- 

Theorem 5.4 (Theorem 1.3 of [5]). Let T : C[x] — » C[x] be a linear transformation. 
Then T maps S[x] into 6[x] if and only if either 

(a) T(/) = rj{f) ■ p for some linear functional rj : C[x] — > C and p £ 6[x], or 

(b) </ie power series 



T(c- xy )= (-i) a r(x a ) 



a! 

a:[ml— >N 



6[x,y] 



(Theorem 3.5 has a similar extension - see Theorems 1.2 and 1.4 of [5].) 

For a < /3 in N m , let (/3) Q = f3\/(f3 - a)!, and for a £ (3 let {0) a = 0. 
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Theorem 5.5 (Theorem 5.1 of |5]). Let F(x,y) = X^qgn™ -P Q (x)y Q be a power 
series in C[x][[y]] (so that each P a £ C[x\). Then _F(x,y) is in 6[x,y] if and only 
if for all P € N m , 

^(/3) Q P Q (x)y Q 

is stable in S[x, yj. 

(This implies the analogous result for real stability, since Sr[x] — 6[x] n R[x].) 

Exercise 5.6. Derive Theorem 5.4 from Theorems 5.2 and 5.5. (Hint: T((x + y) K ) 
is stable if and only if T((l — xy) K ) is stable.) 

One direction of Theorem 5.5 is relatively straightforward. 

Lemma 5.7 (Lemma 5.2 of [5]). Fix P G N m . The linear transformation T : y a i— > 
{P)a~y a on C[y] preserves stability. 

Proof. By Theorem 5.2 and Exercise 5.3(a), it suffices to show that for all k £ N m , 
the polynomial T((y + u) K ) is stable. Now 

m 

r((y + u) K ) = n 

i=l 

so it suffices to show that for all k, b £ N, the polynomial f(t) = Sj=oi'(j) (j)^ ^ s 
real stable. Let gr(i) = (1 + d/dt) k t b . One can check that f(t) = t b g(l/t). It thus 
suffices to show that l + d/dt preserves stability. For any a S N, (l + d/<i£)(£ + u) = 
(t + u + a)(t + u) a_1 is stable, and so Theorem 5.2 implies the result. □ 

Now, let F = F(x, y) be as in the statement of Theorem 5.5, and let (F n : n S N) 
be a sequence of stable polynomials F n (x, y) = X^aeN™ ^,a( x )y Q m 6[x,y] con- 
verging to F uniformly on compact sets. Fix P £ N and define a linear transfor- 
mation T : C[x,y] — > C[x,y] by T(x 7 y Q ) = (/3) Q x 7 y Q and linear extension. By 
Lemma 5.7 and Exercise 5.3, T preserves stability in S[x, y]. Thus, (T(F n ) : n £ N) 
is a sequence of stable polynomials converging to T(F). Since T(F) is a polynomial 
the convergence is uniform on compact sets, and so Hurwitz's Theorem implies that 
T(F) is stable. 

The converse direction of Theorem 5.5 is considerably more technical, although 
the idea is simple. With F as in the theorem, for each n > 1 let 

F„(x,y)= (nl) a P a (x)^. 

a<nl 

The sequence (F n : n > 1) converges to F, since for each a £ N m , n~ a (nl) a — > 1 
as n — > oo. Each F n is stable, by hypothesis (and scaling). The hard work is 
involved with showing that the convergence is uniform on compact sets. To do this, 
Borcea and Branden develop a very flexible multivariate generalization of the Szasz 
Principle [5, Theorem 5.6] - in itself an impressive accomplishment. Unfortunately, 
we have no space here to develop this result - see Section 5.2 of [5]. 
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6. Johnson's Conjectures. 



Let A = (Ai, . . . , Ak) be a fc-tuple of n-by-n matrices. Define the mixed deter- 
minant of A to be 



in which the sum is over all ordered sequences of k pairwise disjoint subsets of [n] 
such that [n] = S% U • • • U Sk, and AjfiSJ is the principal submatrix of Ai supported 
on rows and columns in Si. Let Ai(Si) be the complementary principal submatrix 
supported on rows and columns not in Si, and for j £ [n] let Ai(J) — 

For example, when k = 2 and A\ = xl and A 2 = —B, this specializes to 
Det(xl, —B) = det(xl — B), the characteristic polynomial of B. In the late 1980s, 
Johnson made three conjectures about the k = 2 case more generally. 

Johnson's Conjectures. Let A and B be n-by-n matrices, with A positive defi- 
nite and B Hermitian. 

(a) Then T)et(xA, —B) has only real roots. 

(b) For j £ [n], the roots ofT)et(xA(j),—B(j)) interlace those ofDet(xA,—B). 

(c) The inertia of Det(x A, —B) is the same as that of det(x I — B). 

In part (c), the inertia of a univariate real stable polynomial p is the triple 
i(jp) — lq(p), l+(p)) with entries the number of negative, zero, or positive 

roots of p, respectively. 

In 2008, Borcea and Branden [T] proved all three of these statements in much 
greater generality. 

Theorem 6.1 (Theorem 2.6 of PQ). Fix integers £,m,n > 1. For h € [£} and 

i £ [rn] let Bh and A^ be n-by-n matrices, and let 



(a) If all the A^i are positive semidefmite and all the Bh are Hermitian, then 
Det(X) = Det(ii, . . . , Lf) £ 6r[x] is real stable. 

(b) For each j £ [n], let = (Li(j), . . . , Li(j)). With the hypotheses of part (a), 
the polynomial Det(iL) + t/Det(<C(j)) £ 6r[x, y] is real stable. 

Proof. Let Y — diag(yi, y n ) be a diagonal matrix of indeterminates. By Propo- 
sition 2.1, for each h £ [£] the polynomial 



Det(A) = Bet(A 1 ,...,A k ) = ^ n det ^^' 

(S u ...,S k )i=l 



rn 




det(Y + L h ) = Y S det L h (S) 



SC[n] 



is real stable in 6t[x,y], By inversion of all the y indeterminates, each 



det(I-yL h )= (-l) |S| y S detL h [S] 



SC[n] 
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is real stable. Since Il?i=i det(I-YLh) is real stable, contraction and specialization 
imply that 



Det(£) = (-ir 5y -^-ndet(I-^) 



y=o 



is real stable, proving part (a). 

For part (b), let V be the n-by-n matrix with all entries zero except for Vjj = y. 
By part (a) , 

Det(V, L u L h ) = Det(£) + yDet(£(j)) 
is real stable. □ 

Theorem 6.1 (with Corollary 2.10) clearly settles Conjectures (a) and (b). 

Proof of Conjecture (c). Let A and B be n-hy-n matrices with A positive definite 
and B Hermitian. Let (t_,t ,t + ) be the inertia of det(a;7 — B). Let f(x) = 
Det(xA, —B), and let v + ) be the inertia of /. 

We begin by showing that v§ = to- Since to = min{|S'| : S C [n] and det (£?(£)) ^ 
0}, it follows that vq > lq. The constant term of f(x) is (— l) n det(-B), so that if 
to = then v Q = 0. If t = k > then let S = {si, ...,Sfc} C [n] be such that 
det(B(S)) ^ 0. For < i < k let f t (x) = Det(A({si, .., s<}), -B({sx, .., s;})), so 
that /o(a;) = /(^)- By Theorem 6.1, the roots of and of are interlaced, for 
each i £ [k]. Thus, 

vo = to(/o) < to(/i) + 1 < io(/ 2 ) + 2 < • • ■ < to(/ fc ) + fc = k = t , 

since to(/fc) = because det(J3(S)) 7^ 0. Therefore v = t . 

For any positive definite matrix A, Det(x^4, —B) is a polynomial of degree n. 
Suppose that A is such a matrix for which v + ^ t + . Consider the matrices A\ — 
(1 — X)I + XA for A 6 [0, 1]. Each of these matrices is positive definite. From the 
paragraph above, each of the polynomials g\(x) — Det(xA\,~ B) has Lo{g\) = to- 
Since t+(<?o) = t + ^ v + = L+(g%) and the roots of g\ vary continuously with A, 
there is some value n £ (0,1) for which to(g^) > lq. This contradiction shows that 
u + = t + , and hence v_ = t_ as well. □ 

Borcea and Branden pQ proceed to derive many inequalities for the principal 
minors of positive scmidefinite matrices, and some for merely Hermitian matrices. 
These are applications of inequalities valid more generally for real stable polynomi- 
als. The simplest of these inequalities are as follows. 

For an n-hy-n matrix A, the j-th symmetrized Fisher product is 

*j(A)= ]T det(A[S])det(A(SO). 

SC[n]: \S\=j 

and the j-th averaged Fisher product is <Tj(A) — (™) <Tj(A). Notice that <Jj(A) — 
cr n -j(A) for all < j < n. 

Corollary 6.2. Let A be an n-by-n positive semidefinite matrix. 

(a) Then d^A) 2 > a j - 1 (A)a j+1 (A) for all 1 < j < n - 1. 

(b) Also, a (A) < ax{A) <■■■< d [n/2l . 

(c) If A is positive definite and det(A) — d then 

&i(A) > (MA)Y /2 > (h^Y /3 > ... > f^(A) xl/ ' 
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Proof. It suffices to consider positive definite A. By Theorem 6.1, the polynomial 
T)et(xA, —A) — 53j=o( — l) Jcr j {A)x : > has only real roots, and these roots are all 
positive. Part (a) follows from Newton's Inequalities pJl Theorem 51]. Part (a) 
and the symmetry <7j(A) — a n -j(A) for all < j < n imply part (b). Part (c) 
follows from Maclaurin's Inequalities [TH Theorem 52]. □ 

7. The symmetric exclusion process. 

This section summarizes an application of stable polynomials to probability and 
statistical mechanics from a 2009 paper of Borcea, Branden and Liggett [7J. 

Let A be a set of sites. A symmetric exclusion process (SEP) is a type of Markov 
chain with state space a subset of {0, 1} A . In a state S : A — > {0, 1}, the sites in 
5 _1 (1) are occupied and the sites in 5' _1 (0) are vacant. This is meant to model 
a physical system of particles interacting by means of hard-core exclusions. Such 
models come in many varieties - to avoid technicalities we discuss only the case 
of a finite system A and continuous time t. (The results of this section extend to 
countable A under a reasonable finitencss condition on the interaction rates.) Sym- 
metry of the interactions turns out to be crucial, but particle number conservation 
is unimportant. 

Let E be a set of two-element subsets of A. For each {i, j} e E, let Ay > be 
a positive real, and let ry : A — > A be the permutation that exchanges i and j and 
fixes all other sites. Our SEP Markov chain M proceeds as follows. Each {i,j} € E 
has a Poisson process "clock" of rate Ay, and these are independent of one another. 
With probability one, no two clocks ever ring at the same time. When the clock 
of {i,j} rings, the current state S is updated to the new state S o Tij. In other 
words, when the clock rings, if exactly one of the sites {i,j} is occupied then 

a particle hops from the occupied to the vacant of these two sites. 

Let A = [m] and ft — {0, 1} A , let (fio be an initial probability distribution on J7, 
and let tpt be the distribution of the state of M, starting at tpg, after evolving for 
time t > 0. We are concerned with properties of the distribution ip t that hold for 
all t > 0. 

7.1. Negative correlation and negative association. Consider a probability 
distribution (p on f2. An event £ is any subset of fi. The probability of the event 
£ is Pr[£] = X)se£ fi^)' ^ n even t £ is increasing if whenever S < <S" in f2 and 

5 € £, then S' G £. For example, if K is any subset of A and £^ is the event that 
all sites in K are occupied, then Zk is an increasing event. Notice that this event 
has the form £^ = £' x {0, 1} A ^ K for some event £' C {0,1} K . Two events £ 
and $ are disjointly supported when one can partition A = A U B with A D B = 
and £ = £' x {0,1} B and 9 = {0,1} A x J' for some events £' C {0, 1} A and 

6 C {0,1} B . 

A probability distribution on Q is negatively associated (NA) when Pr[£ n !f] < 
Pr[£] • Pr[3 r ] for any two increasing events that are disjointly supported. It is 
negatively correlated (NC) when Pr[£{ ij j] < Pr[£{jj] • Pr[£{j}] for any two distinct 
sites {i,j} C A. Clearly NA implies NC. 

It is useful to find conditions under which NC implies NA, since NC is so much 
easier to check. The following originates with Feder and Mihail, but many others 
have contributed their insights - see Section 4.2 of [7]. The partition function of 
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any ip : f2 — > M. is the real multiaffine polynomial 

Z{<p)=Z(wy) = $>(S)x s 
sen 

in M[x] MA . If ip is nonzero and nonnegative, then for any a G M A with a > 0, this de- 
fines a probability distribution c,c a : O — >• [0,1] by setting <£> a (S) = ip(S)a s /Z(ip; a) 
for all Seft. 

Feder-Mihail Theorem (Theorem 4.8 of [7]). Lei § be a class of nonzero non- 
negative functions satisfying the following conditions. 

(i) Each ip G S ftas domain {0, 1} A /or some finite set A = A(y>). 

(ii) Lor eacft «s a homogeneous polynomial. 

(iii) for each <p G § and i £ A(<p), Z((/?)| a;i= o diZ{tp) are partition functions of 
members of S. 

(iv) For each ip G S and a e M A(v,) luftft a > 0, ip & is NC. 
Then for every ip G S and a G M A ^^ a > 0, <^ a is AT.A. 

7.2. A conjecture of Liggett and Pemantle. In the early 2000s, Liggett and 
Pemantle arrived independently at the following conjecture, now a theorem. 

Theorem 7.1 (Theorem 5.2 of [7]). If the initial distribution ipo of a SEP is 

deterministic (i.e. concentrated on a single state) then ip t is NA for all t > 0. 

Proof. This amounts to finding a class S of probability distributions such that: 

(1) deterministic distributions are in 6, 

(2) being in 6 implies NA, and 

(3) time evolution of the SEP preserves membership in 6. 

Borcea, Branden, and Liggett [7J identified such a class: ip is in & if and only if 
the partition function Z(ip) is homogeneous, multiaffine, and real stable. (Notice 
that if <p is in 6 then ip & is in 6 for all a G K A with a > 0, by scaling.) We proceed 
to check the three claims above. 

Claim (1) is trivial, since if <p(S) = 1 then Z(ip) = x , which is clearly homoge- 
neous, multiaffine, and real stable. 

To check claim (2) we verify the hypotheses of the Feder-Mihail Theorem. Hy- 
potheses (i) and (ii) hold since Z(ip) is multiaffine and homogeneous. By special- 
ization and contraction, (iii) holds. To check (iv), let a G M A with a > 0, let 
{hi} Q A, and consider the probability distribution ip a on f2. The occupation 
probability for site i is 

[{i}]= 4- Z^ap^^ap 
se{o,i} A : s(i)=i vr ' v ^ ; 

and similarly for Pr[£/j\]. Likewise, Pr[£/jj}] = aiajZ(ip; a) -1 • didjZ(ip;a). Now 

Pr[£ {iJ} ]-Pr[£ {i} ]-Pr[£ w ] = - ^ A^Z^a) < 0, 

by Theorem 3.1. Thus ip a is NC. By the Feder-Mihail Theorem, every ip in 6 is 
NA. 

To check claim (3) we need some of the theory of continuous time Markov chains. 
The time evolution of a Markov chain M with finite state space f2 is governed by 
a one-parameter semigroup T(t) of transformations of MP. For a function F G MP 
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and time t > and state S £ il, (T(t)F)(S) is the expected value of F at time t, 
given that the initial distribution of M is concentrated at S with probability one 
at time 0. In particular, ip t — T(t)<f for all t > 0, and all initial distributions 
ipo- In the case of the SEP we are considering, the infinitesimal generator £ of the 
semigroup T(t) is given by 

{i,j}eE 

For each {i,j} £ E, this replaces each S £ ft by 5 o t^- at the rate Ajj. 

In preparation for Section 7.3, it is useful to regard £ as an element of the real 
semigroup algebra 21 = M.[<E] of the semigroup <£ of all endofunctions f : — > O (with 
the operation of functional composition). The left action of (E on Q is extended to 
a left action of 21 on C[x] as usual: for f e £ and S £ CI, f(x s ) = x^ s \ extended 
bilincarly to all of 21 and C[x]. A permutation a £ 8(A) is identified with the 
endofunction f a : S go u~ x , so this action of 21 agrees with the action of S(m) in 
Section 4.2. A left action of 21 on R Q is defined by Z(f(F)) = f(Z(F)) for all f e € 
and F e ]R n , and linear extension. More explicitly, for f £ €, F £ M. n , and S £ CI, 

(f(F))(S) = F(r 1 (S))=J2{F(S'): S' G and f(S') = S}. 

Consider an element of 21 of the form L = J2iLi ~ 1) with all \ > 0. Let 
Xi < L for all i £ [A], and let A = J2iLi The power series 

= E p f(*W 

fee 

in 2t[[t]] is such that for each f G <£, Pf(i) G K[[t]] is dominated coefficientwise by 
exp((LA — A)t). Thus exp(i£) G 2l[[i]] converges for all t > 0. The semigroup of 
transformations generated by £ is exp(t£). 

To check claim (3) we will show that the semigroup T(t) of the SEP preserves 
stability for all t > 0: that is, if Z(ip ) is stable then Z(<p t ) = T(t)Z(ip ) is stable 
for all t > 0. This reduces to the case of a single pair {i, j} £ E, as follows. If Mi 
and M2 are Markov chains on the same finite state space, with semigroups Ti(t) 
and T 2 (t) generated by £1 and £2, then the semigroup generated by £-1 + £2 is 

T(t)= lim [Ti(t/n)T 2 (t/n)] n , 

71— >00 

by the Trotter product formula. By Hurwitz's Theorem, It follows that if Ti(t) 
preserves stability for all t > and i £ {1,2}, then T(t) preserves stability for all 
t > 0. By repeated application of this argument, in order to show that the SEP 
semigroup T(t) = cxp(i£) preserves stability for all t > it is enough to show that 
for each {i,j} £ E, Tij(t) — ex.p(t\ij(rij — 1)) preserves stability for all t > 0. Now, 
since r?- = 1, 

/l + e - 2 ^*\ /i_ e - 2A «*\ 
T «(*)=(— 2 ) + (~ j' T - 

By Lemma 4.3, this preserves stability for all t>0. This proves Theorem 7.1. □ 



exp(£C) = c 



-Kt 



°° in 



n=0 



N 
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7.3. Further observations. In verifying the hypotheses of the Feder-Mihail The- 
orem we used the fact that if / e ©r[x] ma is multiaffine and real stable, then 
A i3 /(a) > for all {i,j} C E and a G R m , by Theorem 3.1. In fact, we only 
needed the weaker hypothesis that Ay/(a) > for all {i,j} C E and a € R m 
with a > 0. A multiaffine real polynomial satisfying this weaker condition is a 
Rayleigh polynomial. (This terminology is by analogy with the Rayleigh mono- 
tonicity property of electrical networks - see Definition 2.5 of [7] and the references 
cited there. Multiaffine real stable polynomials are also called strongly Rayleigh.) 
The class of probability distributions if such that Z((p) is homogeneous, multiaffine, 
and Rayleigh meets all the conditions of the Feder-Mihail Theorem. It follows that 
all such distributions are NA. 

Claim (2) above can be generalized in another way - the hypothesis of ho- 
mogeneity can be removed, as follows. Let y = (yi, . . . ,y m ) and let e,-(y) be 
the j-th elementary symmetric function of the y. Given a multiaffine polyno- 
mial / = X)sc[m] c (S)x s ', the symmetric homogenization of / is the polynomial 
/sh(x,y) GC[x, y ] MA defined by 

/ah(x,y)= ]T <S)x?(!2) e mHS |(y). 

SC[m] ^ 

Note that / s h is homogeneous of degree m, and / s h(x, 1) = /(x). 

Proposition 7.2 (Theorem 4.2 of |7J. If f E 6r[x] ma is multiaffine and real 
stable then / s h € ©R[x,y] MA is homogeneous, multiaffine and real stable. 

(We omit the proof.) 

Corollary 7.3 (Theorem 4.9 of [7 ). Let <p : Q — ► [0,oo) be such that Z(ip) is 
nonzero, multiaffine, and real stable. Then for all a £ K m with a > 0, tp a is NA. 

Proof. By Proposition 7.2, Z s -^(ip;x,y) is nonzero, homogeneous, multiaffine, and 
real stable. This is the partition function for ip : {0, l}[ 2m l — > [0,oo) given by 
^( S ) = (|sn[m]|)"V(5 n [m]). By claim (2) above, V a is NA for all a e R 2m with 
a > 0. By considering those a € K 2m for which a, = 1 for all m + 1 < i < 2m, it 
follows that ip a is NA for all aeR m with a > 0. □ 

Corollary 7.4 (Theorem 5.2 of [7])- If the initial distribution <po of a SEP is such 
that Z(ip) is stable (but not necessarily homogeneous), then Z(ip t ) is stable, and 
hence tft is NA, for all t > 0. 

It is natural to try extending these results to asymmetric exclusion processes. 
For e A 2 define Uj € £ by Uj(S) = So Tij if S(i) = 1 and S(j) = 0, and 

Uj(S) = S otherwise, for all S £ f2. That is, Uj makes a particle hop from site i 
to site j, if possible. Let E be a set of ordered pairs in A 2 , and for G E let 
Xij > 0. An asymmetric exclusion process is a Markov chain on f2 with semigroup 
T(t) = exp(tiL) generated by something of the form 

By the argument for claim (3) above, in order to show that T(t) preserves sta- 
bility for all t > 0, it suffices to do so for the two-site semigroup X!n,2}(*) = 
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exp(tJ£j/i 2}) generated by 

^{1,2} = Al2(tl2 — 1) + A2l(t21 — 1)- 

Exercise 7.5 (Strengthening Remark 5.3 of [7]). With the notation above, let 
A = A12 + A 2 i, /3i 2 = A12/A, and = A 2 i/A. 

(a) In 21, tia+fai = l + r 12 . 

(b) If lo is any word in {ti 2 , l 2 \} n , then i\ 2 uj = ti2 and t 2 iw = t 2 i. 

(c) The semigroup generated by £{1.2} is 

r {1 , 2} (t) = c- xt + (1 - e~ xt )(f3 12 t 12 + p 21 t 21 ) 

(d) The semigroup T{ 12 }(i) preserves stability for all t > if and only if /3i 2 = 
/3 2 i = 1/2, in which case it reduces to the SEP (of rate A/2). 

Thus, even the slightest asymmetry ruins preservation of stability by the SEP! 

Finally, we consider a SEP in which particle number is not conserved. For i e A 
define a,, a* € £ as follows: for S e and j e A, let (0i(5))(j) = « (S))(j) = 
if j 7^ i, and (dj(S))(i) = and (a*(S))(i) = 1. That is, a, annihilates a particle at 
site i, and a* creates a particle at site i, if possible. 

A SEP with particle creation and annihilation is a Markov chain on f2 with 
semigroup Tit) = exp(i£) generated by something of the form 

in which the first sum is the generator of the SEP in Theorem 7.1 and 6*.;,(9* > 
for each i 6 A. 

By the argument for claim (3) above, to show that this T(t) preserves stability for 
all t > 0, it suffices to do so for the one-site semigroups generated by £1 = 0(di — 1) 
and £f = 0(a5; — 1), respectively. 

Exercise 7.6. The semigroups generated by L\ and L\ are 

T 1 (t) = e- 9t + (l-e- et )ai and = e~ et + (1 - e~ et )al , 

respectively. Both T\(t) and T*(t) preserve stability. 

Corollary 7.7. If the initial distribution (fo of a SEP with particle creation and 
annihilation is such that Z(ip) is stable, then Z(tp t ) is stable, and hence ip t is NA, 
for all t > 0. 

8. Inequalities for mixed discriminants. 

This section summarizes a powerful application of stable polynomials from a 
2008 paper of Gurvits QT]. 

We will use without mention the facts that log and exp are strictly increasing 
functions on (0, 00). A function p : I — > K defined on an interval / C R is convex 
provided that for all a\,a 2 e /, p((a\ + a 2 )/2) < (p(cti) + p(a 2 ))/2. It is strictly 
convex if it is convex and equality holds here only when a\ = a 2 . A function 
p : I — > R is (strictly) concave if —p is (strictly) convex. For example, for positive 
reals a±, a 2 > one has {y/a\ — sfa^) 2 > 0, with equality only if ai = a 2 . It follows 
that log((ai + a 2 )/2) > (log(ai) + log(o 2 ))/2, with equality only if a\ — a 2 . That 
is, log is strictly concave. 
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Jensen's Inequality (Theorem 90 of [12). Let p : I — > K be defined on an interval 
ICR, let a,i £ I for i £ [n], and let 6j > /or i G [n] &e such that X)"=i = 1- -tf 
p is convex then 




n 



P I /^ b i a i < 2_^bip(ai). 

-1 / i=i 

// p is strictly convex and equality holds, then ax = &2 = • • • = a n . 

For integer d > 1, let G(d) = (1 - l/d) d ~\ and let G(0) = 1. Note that 
G(l) = 0° = 1, and that G(d) is a strictly decreasing function for d > 1. For 
homogeneous / £ K[x] with nonnegative coefficients, define the capacity of / to be 

cap(/) = inf ^ , 

c>0 a ■ ■ ■ C m 

with the infimum over the set of all c £ K m with Cj > for all i £ [m] . 

Lemma 8.1 (Lemma 3.2 of Let f = X)<=o ^ ia; ^ be a nonz ^ ro univariate 

polynomial of degree d with nonnegative coefficients. Lf f is real stable then b\ — 
/'(0) > G(<i)cap(/), and i/cap(/) > then equality holds if and only if d < 1 or 
f(x) = b d (x + £) d for some £ > 0. 

Proof. If cap(/) = then there is nothing to prove, so assume that cap(/) > 0. 
If d = then /'(0) = 6i = = G(0)cap(/), and if d = 1 then /'(0) = bi = 
G(l)cap(/), so assume that d > 2. If /(0) = then /'(0) = lim c ^ f(c)/c > 
cap(/) > G(d)cap(/). Thus, assume that d > 2 and /(0) = b > 0. We may rescale 
the polynomial so that 6o = 1. Now there are a% > for i S [d] such that 

,/ 

f( X ) = JJ(1 + Oil), 

t=i 

and 6i = ai + • • • + a f j. For any c > we have 

d 



log(cap(/)c) log(/(c)) 



= i ^ log(l + a lC ) < log (l + ^r) , 



by Jensen's Inequality. It follows that cap(/)c < (1 + b\cjd) d for all c > 0. Let 
g(x) = (1 + b 1 x/d) d . Elementary calculus shows that 

/ \ . , g(c) g(c*) h . d 

cap(g) = mi = = — — — - , m wnicn — 



oo c c* G(d)' * i>i(d— 1)' 

Since cap(/) < cap(g), this yields the stated inequality. If equality holds, then 
equality holds in the application of Jensen's Inequality, and so / has the stated 
form. □ 

Lemma 8.2 (Theorem 4.10 of [H]). Let f £ 6r[xi, x m ] be real stable, with 
nonnegative coefficients, and homogeneous of degree m. Let g = <9 m /U m =o- Then 

cap(^) > G(deg m (d))cap(/). 

Proof. We may assume that d = deg m (/) > 1. Let c, > for i £ [m — 1], and let 
Vc{x) = f(c%, ...,c m -i,x). Since / has nonnegative coefficients, p c ^ 0. As in the 
proof of Lemma 2.4(f), p c has degree d. By specialization, p c is real stable. Lemma 
8.1 implies that 

g(c) =p' c (0) > G(d)cap(p c ) > G(d)cap(/) 
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for all c € with c > 0. If m = 1 then g = cap(g) is a constant. If to > 2 

then for any such c let b = (ci • • • c m -i)~ 1 /( rn ~ 1 \ Since g is homogeneous of degree 
to — 1, 

— ^ = g(bc lt 6c m _i) > G(d)cap(/). 

ci • • ■ c m _i 

It follows that cap(g) > G(<i)cap(/). □ 

Theorem 8.3 (Theorem 2.4 of [H]). Let f £ ©]g[a;i, a; TO ] &e reaZ stable, with 
nonnegative coefficients, and homogeneous of degree m. Let deg^/) = <i, and 
Gj = min{z, d;} for each i G [to]. Then 



a 1 /(0)>cap(/)nG(e J ). 



i=2 

Proof. Let <7 m = / and let gi-i = digi\ Xi= o for all i £ [to]. By contraction and 
specialization, ^ is real stable for each i € [to]. Notice that g = d 1 f(0) = 
cap(go)- By Lemma 8.2. cap(gi_i) > cap(#i) • G(deg i gi) for each i € [to]. But 
deg 4 < deg^ f = d t , and deg^ (^ is at most the total degree of gi, which is i. Hence 
deg 4 gi < et, and thus G(deg i gi) > G(ei). Thus cap(gi_i) > cap(gi) • G(ei) for each 
i € [m]. Combining these inequalities (and G(ei) — 1) gives the result. □ 

With the notation of Theorem 8.3, since < i for all i € [to] and G(c?) is a 
decreasing function of d, one has the inequality 



nG( e ,)>ri G «=n 



1 \ to! 



TO/' 



i=2 i=2 i=2 

Thus, the following corollary is immediate. 

Corollary 8.4. Let f G 6 K [ xi,...,x m ] &e reaZ stable, with nonnegative coefficients, 
and homogeneous of degree to. TTien 



OV(0)>£^-cap(/). 



Theorem 8.5 (Theorem 5.7 of [H]). Let f £ 6i[ii,...,i m ] &e reaZ stable, with 
nonnegative coefficients, and homogeneous of degree to. Equality holds in the bound 
of Corollary 8.4 if and only if there are nonnegative reals > for i € [m] such 
that 

/(x) = (ai^i H h a m a; m ) m . 

(We omit the proof.) 

Lemma 8.6 (Fact 2.2 of [TT]). Lei / € K[xi, ar m ] be homogeneous of degree 
to, with nonnegative coefficients. Assume that dif(l) = 1 for all i G [to]. Then 
cap(/) = 1. 

Proof. Let / = ^( a ) x " > so that if 6(a) ^ then |a| = X^I=i a W = m - By 
hypothesis, for all i <E [to], ^ q 6(a)a(i) = 1. Averaging these over all i <G [to] yields 
/(l) = £a fe O) = 1; so that cap(/) < 1. Conversely, let c € E m with c > 0. 
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Jensen's Inequality implies that 



log(/(c)) = log [J2 Ka)C 



a 



) 



> 



b ( a ) lo g( cQ ) = log(cj) Y b{a)a{i) = log(ci • • • c m ). 




It follows that cap(/) > 1. 



□ 



Example 8.7 (van der Waerden Conjecture). An m-by-m matrix A — (ay) is 
doubly stochastic if all entries are nonnegative reals and every row and column 
sums to one. In 1926, van der Waerden conjectured that if A is an m-by-m doubly 
stochastic matrix then per(A) > ml/m m , with equality if and only HA— (1/m) J, 
the m-by-m matrix in which every entry is 1/m. In 1981 this lower bound was 
proved by Falikman, and the characterization of equality was proved by Egorychev. 
These results follow immediately from Corollary 8.4 and Theorem 8.5, as follows. 
It suffices to prove the result for an m-by-m doubly stochastic matrix A = (ay) 
with no zero entries, by a routine limit argument. The polynomial 



is clearly homogeneous and real stable, with nonnegative coefficients and of degree 
to, and such that deg^/^) = m for all i £ [to]. Since A is doubly stochastic, Lemma 
8.6 implies that cap (/a) = 1. Since 



Corollary 8.4 and Theorem 8.5 imply the results of Falikman and Egorychev, re- 
spectively. Gurvits |llj also uses a similar argument to prove a refinement of the 
van der Waerden conjecture due to Schrijver and Valiant - see also |13j . 

Given n-by-n matrices Ai,...,A m , the mixed discriminant of A — (Ai, A m ) is 



This generalizes the permanent of an m-by-m matrix B — (6y) by considering the 
collection of matrices A(B) — (A\, ,.,,A m ) defined by Ah = diag(a/ji, a/j m ) for 
each h £ [m] . In this case one sees that 



with the notation of Example 8.7, and it follows that Disc(A(_B)) = per(£?). 

Example 8.8 (Bapat's Conjecture). Generalizing the van der Waerden conjec- 
ture, in 1989 Bapat considered the set fi(m) of m-tuples of m-by-m matrices 
A = [A\, A m ) such that each Ai is positive semidennite with trace tr(Ai) = 1, 
and Y^iLi A-i = I. For any doubly stochastic matrix B, A(B) is in this set. The 
natural conjecture is that for all A £ f2(m), Disc(A) > m!/m m , and equality is 
attained if and only if A = A((l/m)J). This was proved by Gurvits in 2006 - 
again, it follows directly from Corollary 8.4 and Theorem 8.5. It suffices to prove 
the result for A € f2(m) such that each Ai is positive definite, by a routine limit 
argument. By Proposition 2.1, for A £ fi(m), the polynomial 



/a(x) = (oyafi H 1- a mj x m ) 



per(A) = OVa(O), 



Disc(A) = d 1 det(xi^4i H h x m A m ) 



x=0 ' 



det(xiAi H h x m A m ) = /b(x) 



/a(x) = det(xiA 1 H h x m A, 



-in 
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is real stable. Since each Ai is positive definite, all coefficients of /a are nonnegative, 
/a is homogeneous of degree m, and deg^/^) = m for all i £ [to]. Since A £ Q(m), 
Lemma 8.6 implies that cap(/A) = 1. Thus, /a satisfies the hypothesis of Theorems 
8.3 and 8.5, and since Disc(A) = 9 1 /a(0), the result follows. 

9. Further Directions. 

9.1. Other circular regions. Let f2 C C m . A polynomial / <E C[x] is 0,-stable if 
either / = identically, or /(z) ^ for all z £ £1. At this level of generality little 
can be said. If £1 = A\ x • • • x A m is a product of open circular regions then there 
are Mobius transformations z i— > 4>i{z) — (fiiZ + b{) / (ciZ + dA such that <fo(9£) = A; 
for all i £ [to]. The argument in Section 4.1 shows that / £ C[x] is fi-stable if and 
only if 

/ = (CZ + d) do ^ ■ f(Mzi), ^W) 

is stable. In this way results about stable polynomials can be translated into results 
about f2-stable polynomials for any f2 that is a product of open circular regions. 

Theorem 6.3 of [5] is the fi-stability analogue of Theorem 5.2. We mention only 
two consequences of this. Let D — {z £ C : \z\ < 1} be the open unit disc, 
and for 6 £ R let % e = {e~ w z : z £ 'K}. Thus J£ = K, and M l/2 is the open 
right half-plane. A D m -stable polynomial is called Schur stable, and a J£^ 2 -stable 
polynomial is called Hurwitz stable. 

Proposition 9.1 (Remark 6.1 of 5 .). Fix k £ N m , and let T : C[x]^ K C[x] be 
a linear transformation. The following are equivalent: 

(a) T preserves Schur stability. 

(b) T((l + xy) K ) is Schur stable in C[x,y]. 

Proposition 9.2 (Remark 6.1 of 0.). Fix k £ N m , and let T : C[x]^ K -> C[x] be 
a linear transformation. The following are equivalent: 

(a) T preserves Hurwitz stability. 

(b) T((l + xy) K ) is Hurwitz stable in C[x, y]. 

9.2. Applications of Theorem 5.4. It is natural to consider a multivariate 
anaogue of the multiplier sequences studied by Polya and Schur. Let A : N" 1 — * R, 
and define a linear transformation T\ : C[x] — > C[x] by T\(x a ) = A(a)x Q for all 
a € N m , and linear extension. For which A does T\ preserve real stability? The 
answer: just the ones you get from the Polya-Schur Theorem, and no more. 

Theorem 9.3 (Theorem 1.8 of [4 .). Let A : N m — * R. Then T\ preserves real 
stability if and only if there are univariate multiplier sequences Aj : N — > R for 
i £ [m] and e £ {— 1, +1} such that 

A(a) = Ai(a(l)) • • • A m (a(m)) 

for all a £ N m , and either e l Q lA(a) > for all a £ N m , or e^X(a) < for all 
a £ N m . 

Theorem 5.4 (and similarly Propositions 9.1 and 9.2) can be used to derive a 
wide variety of results of the form: such-and-such an operation preserves stability 
(or Schur or Hurwitz stability). Here is a short account of Hinkkanen's proof of the 
Lee- Yang Circle Theorem, taken from Section 8 of [B]. 
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For f,g e C[x] MA , say / = £ S c [ro ] a(5)x s and <? = £ sgH K^)x S , let 

SC[m] 

be the Schur-Hadamard product of / and g. 

Theorem 9.4 (Hinkkanen, Theorem 8.5 of [5]). If f,g G C[x] MA are Schur stable 
then f • g is Schur stable. 

Proof. Let T g : C[x] MA -> C[x] MA be defined by />->/• g. By Proposition 9.1, 
to show that T g preserves Schur stability it suffices to show that T g ((l + xy)[ m l) 
is Schur stable. Clearly T g ((l + xy)[™l) = g{x\y\, ...,x m y m ) is Schur stable since 
<?(x) is. Hence T g preserves Schur stability, and so / • g is Schur stable. □ 

Theorem 9.5 (Lee- Yang Circle Theorem, Theorem 8.4 of [5]). Let A = (aij) be a 
Hermitian m-by-m matrix with \a,ij\ < 1 for all i, j S [m]. Then the polynomial 

/(*)= e * s nrN 

SC[ra] i£S j£S 

is Schur stable. The diagonalization g(x) = f(x,...,x) is such that x m g(\/x) — 
g(x), and it follows that all roots of g(x) are on the unit circle. 

Proof. For i < j in [m] let 

fij = (1 + dijXi +OijXj + XiXj) Yl (l + ^fc.)' 

One can check that each is Schur stable. The polynomial /(x) is the Schur- 
Hadamard product of all the /y for C [m]. By Theorem 9.4, /(x) is Schur 
stable. □ 

Section 8 of [6] contains many many more results of this nature. 

9.3. A converse to the Grace-Walsh- Szego Theorem. The argument of Sec- 
tions 4.2 and 4.3 can be used to prove the following. 

Exercise 9.6. If / € S[x] MA is multiaffine and stable then 

is multiaffine and stable. 

This is in fact equivalent to the GWS Theorem, since for all / 6 C[x] MA , 
Tg( m )/(x) = Pol m /(a;, . . . , x). For which transitive permutation groups G < S(m) 
does the linear transformation Tq — |G| _1 J2aea a P reserve stability? The answer: 
not many, and they give nothing new. 

Theorem 9.7 (Theorem 6 of [9].). Let G < §(m) be a transitive permutation group 
such that Tg preserves stability. Then Tq = T§t m \. 
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9.4. Phase and support theorems. A polynomial / E C[x] has definite parity if 
every monomial x Q occurring in / has total degree of the same parity: all are even, 
or all are odd. 

Theorem 9.8 (Theorem 6.2 of [TO]). Let f E C[x] be Hurwitz stable and with 
definite parity. Then there is a phase < 8 < 2ir such that e /(x) has only real 
nonnegative coefficients. 

The support of / = c(a)x a is supp(/) = {a E N m : c(a) ^ 0}. Let 
Si denote the unit vector with a one in the i-th coordinate, and for a E Z" let 
|ct| — J2"Li \ a (i)\- A jump system is a subset 3 C Z m satisfying the following two- 
step axiom: 

(J) If a, (3 E 3 and i E [m] and e E { — 1,+1} are such that a' — a + eSi satisfies 
\a' — j3\ < \a — /3\, then either a' E d or there exists j E [m] and e E {—1, +1} such 
that a" = a' + e5j E 3 and \a" - j3\ < \a' - /3|. 

Jump systems generalize some more familiar combinatorial objects. A jump 
system contained in {0, l} m is a delta-matroid. A delta-matroid 3 for which \a\ is 
constant for all a E 3 is the set of bases of a matroid. For bases of matroids, the 
two-step axiom (J) reduces to the basis exchange axiom familiar from linear algebra: 
if A, B E 3 and a E A \ B, then there exists b E B \ A such that (A \ {a}) U {b} 
is in 3- 

Theorem 9.9 (Theorem 3.2 of [S]). If f E S[x] is stable then the support supp(/) 
is a jump system. 

Recall from Section 7 that for multiaffine polynomials with nonnegative coeffi- 
cients, real stability implies the Rayleigh property. A set system 3 is convex when 
A, B E 3 and A C B imply that C E 3 for all A C C C B. 

Theorem 9.10 (Section 4 of [H]). Let f — J2sc[ m ] c (S) xS be multiaffine with real 
nonnegative coefficients, and assume that f is Rayleigh. 

(a) The support supp(/) is a convex delta-matroid. 

(b) The coefficients are log-submodular: for all A, B C [m], 

c{A n B)c(A UB)< c{A)c{B). 
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